首页> 外文OA文献 >Controlling false discoveries in high-dimensional situations: Boosting with stability selection
【2h】

Controlling false discoveries in high-dimensional situations: Boosting with stability selection

机译:控制高维情况下的错误发现:提升   稳定性选择

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

Modern biotechnologies often result in high-dimensional data sets with muchmore variables than observations (n $\ll$ p). These data sets pose newchallenges to statistical analysis: Variable selection becomes one of the mostimportant tasks in this setting. We assess the recently proposed flexibleframework for variable selection called stability selection. By the use ofresampling procedures, stability selection adds a finite sample error controlto high-dimensional variable selection procedures such as Lasso or boosting. Weconsider the combination of boosting and stability selection and presentresults from a detailed simulation study that provides insights into theusefulness of this combination. Limitations are discussed and guidance on thespecification and tuning of stability selection is given. The interpretation ofthe used error bounds is elaborated and insights for practical data analysisare given. The results will be used to detect differentially expressedphenotype measurements in patients with autism spectrum disorders. All methodsare implemented in the freely available R package stabs.
机译:现代生物技术通常会产生高维数据集,其变量远多于观测值(n $ \ ll $ p)。这些数据集对统计分析提出了新的挑战:变量选择成为此设置中最重要的任务之一。我们评估最近提出的用于可变选择的灵活框架,称为稳定性选择。通过使用重采样程序,稳定性选择将有限采样误差控制添加到高维变量选择程序(如套索或增强)。我们考虑了升压和稳定性选择的组合,并通过详细的仿真研究得出了结果,该研究提供了对该组合有用性的见解。讨论了局限性,并给出了稳定性选择的规范和调整的指南。详细阐述了所使用的误差范围,并给出了实用数据分析的见解。该结果将用于检测自闭症谱系障碍患者的差异表达表型。所有方法均在免费提供的R包中实施。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号